Comparison of full-text searching to metadata searching for genes in two biomedical literature cohorts

نویسندگان

  • Bradley M. Hemminger
  • Billy Saelim
  • Patrick F. Sullivan
  • Todd J. Vision
چکیده

also is significantly lower than that of metadata searching. Certain features of articles correlated with higher relevance ratings. A significant feature measured was the number of matches of the search term in the full-text of the article, with a larger number of matches having a statistically significant higher usefulness (i.e., relevance) rating. By using the number of hits of the search term in the full-text to rank the importance of the article, performance of full-text searching was improved so that both recall and precision were as good as or better than that for metadata searching. This suggests that full-text searching alone may be sufficient, and that metadata searching as a surrogate is not necessary. Introduction Traditionally, most researchers have searched for scholarly information through bibliographic databases which match search keywords against the metadata that describes the content, with journal articles being the most common form of content (Hersh et al., 2006). Examples of commonly used bibliographic databases include PubMed and the ISI Web of Knowledge. The metadata description serves as a surrogate for the complete article itself. With the advent of electronic (i.e., digital) versions of articles being available, there has been an increased interest in searching the complete, or " full-text, " article itself. Many publishers are beginning to support full-text searching of their online content (e. found that the vast majority of people (89%) turn to search engines to initiate their searches for information while few use library Web pages (2%) or online databases (2%). Even academic research scientists prefer search engines over library Web pages for their information searching for research purposes (Hemminger, 2005, 2007) and are increasingly turning to meta-search interfaces such as Google Scholar to perform full-text searches. Several factors have led to the success of full-text tools such as Google Scholar: having a single simple search interface covering all resources (meta-search), the increasing amount of scholarly material available on Web pages or through resources made available to search engines, instant results and the ability to access the final content via a single click, and the utility of full-text searching versus metadata searching. This article is concerned with the latter issue—understanding in more detail how full-text searching compares with metadata-based searching of scholarly literature. While it is clear that full-text matches of search strings yield more matches than does just searching for matches within the metadata of articles, it is not evident how many more …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of full-text versus metadata searching in an institutional repository: Case study of the UNT Scholarly Works

Authors in the library science field disagree about the importance of using costly resources to create local metadata records, particularly for scholarly materials that have full-text search alternatives. At the University of North Texas (UNT) Libraries, we decided to test this concept by answering the question: What percentage of search terms retrieved results based on full-text versus metadat...

متن کامل

Semantic Web Technologies for a Knowledge Base of Biomedical Facts Extracted from Scientific Literature

Biomedical literature, including scientific articles, public health reports and books become more and more available due to massive digitalization. Exploration and analysis of this rich source of data requires assistance of automatic tools capable of dealing with large volumes of text. We are developing a pipeline for processing publicly available biomedical text, abstracts, full text articles,...

متن کامل

مطالعه مروری بررسی تاثیرات استفاده از باند نواری کینزیولوژیک در ناحیه عضله تراپز فوقانی بر روی اختلالات اسکلتی عضلانی یک چهارم فوقانی بدن.

abstract Background: we undertook a literature review to produce evidence-based recommendation for the kinesio tape of upper trapezius muscle in the musculoskeletal disorders of upper quarter region. Data source: a full literature electronic search was performed using google scholar ,pubmed, science direct, proquest, medline, advanced google and pedro database. The following keywords w...

متن کامل

Full-text Search for Thai Information Retrieval Systems

While there have been a lot of efficient full-text search algorithms developed for English documents, these algorithms can be directly used for other languages, e.g. Chinese, Japanese, Thai and so on. However, due to idiosyncrasies of each individual language, directly applying such algorithms may not be suitable for the language considered. This paper proposes a simplification of Boyer-Moore a...

متن کامل

Some New Properties of the Searching Probability

Consider search designs for searching one nonzero 2- or 3-factor interaction under the search linear model. In the noisy case, search probability is given by Shirakura et al. (Ann. Statist. 24(6) (1996) 2560). In this paper some new properties of the searching probability are presented. New properties of the search probability enable us to compare designs, which depend on an unknown parameter ?...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JASIST

دوره 58  شماره 

صفحات  -

تاریخ انتشار 2007